In-car speech recognition using model-based wiener filter and multi-condition training

نویسندگان

  • Masanori Tsujikawa
  • Takayuki Arakawa
  • Ryosuke Isotani
چکیده

This paper presents in-car speech recognition using a modelbased Wiener filter (MBW) and multi-condition (MC) training. The MBW is a 2-step denoising algorithm based on both rough and precise estimation of speech signals. Correcting roughly estimated signals with a Gaussian mixture model (GMM) makes it possible to accurately denoise with little computational cost. In an evaluation of in-car speech recognition, training of both a GMM and a back-end hidden Markov model (HMM) was performed using both studio-recorded speech signals as well as those signals mixed with in-car noise signals that were recorded in real car environments. In-car speech signals for testing were recorded with a plurality of microphones in different car environments. With respect to word accuracy obtained with MCtrained HMM, it was confirmed that the MBW with MC-trained GMM outperformed the Noise Reduction in ETSI advanced front-end.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Unified Approach of Compensation and Soft Masking Incorporating a Statistical Model into the Wiener Filter

In this paper, we present a new single-channel noise reduction method that integrates compensation and soft masking into the same statistical model assumptions for noise-robust speech recognition. By utilizing a Gaussian mixture model(GMM) as a pre-knowledge of speech and added noise signals, the proposed method can effectively restore clean speech spectra and separate out ambient noises from a...

متن کامل

A Multichannel Feature Compensation Approach for Robust ASR in Noisy and Reverberant Environments

In this paper we propose a multichannel feature compensation approach for automatic speech recognition in reverberant and noisy environments. The proposed technique propagates the posterior of the clean signal estimated by a multichannel Wiener filter in short-time Fourier transform (STFT) domain into Mel-frequency cepstrum coefficients (MFCC) domain. The multichannel Wiener filter reduces both...

متن کامل

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...

متن کامل

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...

متن کامل

Robust Speech Recognition for Adverse Environments

As the state-of-the-art speech recognizers can achieve a very high recognition rate for clean speech, the recognition performance generally degrades drastically under noisy environments. Noise-robust speech recognition has become an important task for speech recognition in adverse environments. Recent research on noise-robust speech recognition mostly focused on two directions: (1) removing the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008